$\Upsilon$-DB: Managing scientific hypotheses as uncertain data
نویسندگان
چکیده
In view of the paradigm shift that makes science ever more data-driven, we consider deterministic scientific hypotheses as uncertain data. This vision comprises a probabilistic database (p-DB) design methodology for the systematic construction and management of U-relational hypothesis DBs, viz., Υ-DBs. It introduces hypothesis management as a promising new class of applications for p-DBs. We illustrate the potential of Υ-DB as a tool for deep predictive analytics.
منابع مشابه
Υ-DB: Managing scientific hypotheses as uncertain data
In view of the paradigm shift that makes science ever more data-driven, we consider deterministic scientific hypotheses as uncertain data. This vision comprises a probabilistic database (p-DB) design methodology for the systematic construction and management of U-relational hypothesis DBs, viz., Υ-DBs. It introduces hypothesis management as a promising new class of applications for p-DBs. We il...
متن کامل$Υ$-DB: A system for data-driven hypothesis management and analytics
The vision of Υ-DB introduces deterministic scientific hypotheses as a kind of uncertain and probabilistic data, and opens some key technical challenges for enabling data-driven hypothesis management and analytics. The Υ-DB system addresses those challenges throughout a design-by-synthesis pipeline that defines its architecture. It processes hypotheses from their XML-based extraction to encodin...
متن کاملManaging large-scale scientific hypotheses as uncertain and probabilistic data
of Thesis presented to LNCC/MCT in partial fulfillment of the requirements for the degree of Doctor of Sciences (D.Sc.) MANAGING LARGE-SCALE SCIENTIFIC HYPOTHESES AS UNCERTAIN AND PROBABILISTIC DATA Bernardo Gonçalves February 2015 Advisor: Fabio Porto, D.Sc. In view of the paradigm shift that makes science ever more data-driven, in this thesis we propose a synthesis method for encoding and man...
متن کاملManaging and Mining Uncertain Data Managing and Mining Uncertain Data
In recent years, uncertain data has become ubiquitous because of new technologies for collecting data which can only measure and collect the data in an imprecise way. Furthermore, many technologies such as privacy-preserving data mining create data which is inherently uncertain in nature. As a result there is a need for tools and techniques for mining and managing uncertain data. This chapter d...
متن کاملSemantics Representation of Probabilistic Data by Using Topk-Queries for Uncertain Data
Database systems for uncertain and probabilistic data promise to have many applications. Query processing on uncertain data occurs in the contexts of data warehousing, data integration, and of processing data extracted from the Web. Data cleaning can be fruitfully approached as a problem of reducing uncertainty in data and requires the management and processing of large amounts of uncertain dat...
متن کامل